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We study a simple model of information propagation in social networks, where two quantities are introduced: 
the spread factor, which measures the average maximal fraction of neighbors of a given node that interchange 
information among each other, and the spreading time needed for the information to reach such fraction of 
nodes. When the information refers to a particular node at which both quantities are measured, the model can 
be taken as a model for gossip propagation. In this context, we apply the model to real empirical networks 
of social acquaintances and compare the underlying spreading dynamics with different types of scale-free and 
small-world networks. We find that the number of friendship connections strongly influences the probability of 
being gossiped. Finally, we discuss how the spread factor is able to be applied to other situations. 
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I. INTRODUCTION AND MODEL 

In every-days life probably everyone has already experi- 
enced the annoying situation of telling some personal secret 
to some friend and ending with a naive "please, do not tell 
that to anyone, ok?" and after short time all our friends sud- 
denly know the secret. What happened? Is this common 
phenomenon a consequence of a natural instinct that friends 
have to conspire and slander against each other? Or is this a 
phenomenon which can hardly be avoid by human trust and 
respect being closely related to the net of acquaintances that 
people naturally tend to form? 

Such kind of questions can be easily addressed by repre- 
senting the social system, composed by individuals and the 
interactions among them, as a network, i.e., as a collection of 
nodes and links. While networks have been widely used by 
physicists to studye.g. porous media ijTt] or a system of inter- 
acting spins bSHIj they can also be used to study social sys- 
tems. Social networks have helped to further understand the 
structure and evolution of social systems, where people and 
their acquaintances are represented by the nodes and links of 
the network respectively. In particular, propagation of infor- 
mation in social systems is easily reproduced in such networks 
and has been addressed in recent physical literature (HI 0, 01 
due to its importance in epidemiology where information 
is related to the contagious of diseases, to understand social 
influence, beliefs and extremism |To , [TTl [T2I1 , to under- 
stand the evolution of financial markets psl] , to study econo- 
physical networks underlying e.g., electrical supply systems 
or road webs among airports or cities. Here we put empha- 
sizes on how far the information can spread when particular 
constraints, of interest for social systems, are taken into ac- 
count. 

The way information spreads over the network depends on 
its content. A rumour or an opinion concerning some topic 
which is not directly connected to the social network structure 
(political opinion, etc) can be of interest to any of the neigh- 




FIG. 1 : (Color online) Spreading of information about a target-node 
shown as the grey (red) open circle on part of a real school friendship 
network (TUl . If the spreading starts from one of the white squared 
neighbors, no propagation occurs (/ = 0). If instead, one of the 
grey (yellow) squared neighbors starts the spreading, in r = 3 time- 
steps, five neighbors will know it, giving / = 5/7. The information 
spreads over the dashed (blue) lines. The information can be seen 
as a gossip about the target-node or the victim (see text). Note that 
the clustering coefficient of the victim has a different value, namely 
C = 10/42. 



bors of a certain node, regardless their topological features. 
However, as opposed to rumors, a gossip always targets the 
details about the behavior or private life of a specific person, 
i.e., of a specific node. This node will be called henceforth 
the target-node or the victim. Therefore, due to this particu- 
lar content, it is reasonable to assume as a first approach that 
the information spreads only over people directly connected 
to the victim. 
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A simple model recently introduced ifTill for such kind of 
information spreading is described as follows. Selecting ran- 
domly a victim, the gossip about him or her is created at 
time t = by an originator which shares a bond with the 
victim. At i = 1 the originator only spreads the gossip to 
other nodes, which are connected to him-/herself and the vic- 
tim. The spread continues until all reachable acquaintances of 
the victim know it, as illustrated by the squares connected by 
dashed lines in Fig.[T]for a real friendship network ifTsll . Our 
dynamics is therefore like a burning algorithm lfl6ll . starting 
at the originator but limited to sites that are neighbors of the 
victim. 

To measure how effectively the gossip - or, in general, the 
information - attains the acquaintances of the victim, we de- 
fine the spreading factor as / = nf/k, where n/ is the to- 
tal number of people who eventually hear the gossip and k is 
the degree of the the victim. In addition, we also define the 
spreading time t which defines the minimum time it takes to 
reach this fraction / of acquaintances, giving a measure of 
how far these connected acquaintances are from each other It 
is important to note that / and the standard definition of clus- 
tering coefficient C UtI [isll are different quantities, since the 
later only measures the number of bonds between neighbors 
and contains no information about how such bonds distribute 
among the victim's acquaintances. 

We start in Sec. |II]by studying how such kind of infor- 
mation spreads in different networks, namely in scale-free 
and in small-world networks. Some analytical considerations 
will be present for the particular case of the Apollonian net- 
work IJJ]. The results of such artificial networks are also 
compared to the ones obtained with an empirical network of 
social contacts recently obtained from an U.S. School sur- 
vey iflSll. where friendship acquaintances were rigorously de- 
fined lll5l[l9ll . There are also situations where the information 
about the target-node can be of interest beyond the first neigh- 
bors, like the case where the victim is a movie star, yielding 
a scenario similar to the one of usual rumour propagation or 
even epidemic spreading 1I20I1 . These cases will be consid- 
ered in SecHn] Since the tendency for spreading information 
does not always implies that its transmission will be certain, 
we introduce in Sec.|IV]a probability for each node to spread 
the information and study the main effects on the spreading 
dynamics. Discussion and conclusions are given in SecIV] 



II. SPREADING INFORMATION OVER FIRST 
NEIGHBORS 

We consider first a Barabasi-Albert (BA) scale-free net- 
work |i2ll|: starting with a small number m of nodes fully 
connected to each other one adds iteratively one new node 
with 771 initial links attached to the nodes of the network with 
a probability proportional to the node degree. 

In Fig.|2^ we show the average spreading time r as a func- 
tion of the degree fc in a scale-free network with = 10"' 
nodes and m ^ 3,5 and 7. In all cases, for large values of k, 
T scales logarithmically with the degree 
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FIG. 2: (a) Spreading time r in a Barabasi-Albert scale-free network 
and (b) the spreading factor /, both as a function of fc: m = 3 
(circles), m = 5 (squares) and m = 7 (triangles). The dashed line 
in (b) indicates / = The inset in (b) is a zoom of the plot 

for 771 = 5 emphasizing the optimal degree ko which minimizes the 
gossip spreading (see text). In all cases, iV = 10" nodes, averages 
over 500 realizations are considered, and logarithmic binning in k is 
used. 



where for this case A = —10.77 and B = 2.433 defines the 
dashed line in Fig. [2^. 

For the same values of 777 we plot in Fig. |2}3 the depen- 
dence of the spread factor with the degree. Curiously, one 
sees an optimal degree fco for which the spreading factor at- 
tains a minimum (see inset). This optimal value lies typically 
in the middle range of the degree spectrum showing that the 
two extreme situations of having either few or many neighbors 
enhance the relative broadness of the information spreading. 
Further, a closer look shows that for small degrees the values 
of / coincide with / = 1/fc (dashed line) while for larger de- 
grees / deviates from 1/fc with a deviation which increases 
with m. Thus, while initially (t = 0) the spread factor is al- 
ways / = 1/fc (dashed line), for the subsequent time-steps 
one observes that nodes with small degrees remain on average 
at / = 1/fc while for large degrees the spread factor increases 
up to a maximal value. 

The dependence of the optimal value fco on the two param- 
eters N and 777 is studied in Fig. [3] Here, we observe that the 
optimal degree fc-Q yields approximately 
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The scale-free networks considered above are probabihstic. 
In other contexts, deterministic scale-free networks have been 
proposed Uj, {22j\, as a way to construct perfect hierarchical 
networks. One of such networks is the Apollonian network. 
The Apollonian network is constructed in a purely determin- 
istic way lfll l23ll as illustrated in Fig.|4^: one starts with three 
interconnected nodes, defining a triangle; at 77 = (gener- 
ation 0) one inserts a new node at the center of the triangle 
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FIG. 3: The optimal degree ko in a BA network as a function (a) 
of N fixing 771 = 5 initial outgoing connections and (b) of m for 
N = 10'' nodes. The average degree is (k) = 2m. The dotted lines 
have slopes of a = 4.64 and ~b = —1.34 (see Eq. lO). 




FIG. 4: (a) Illustration of the first three generations of an Apollonian 
network (see text), (b) Spreading time r for the spreading factor 
to attain the maximal value f ~ 1 where the dashed line can be 
obtained analytically (see text) yielding an expression as in Eq. ([T) 
with A = -0.28 and B = 1.1. 



and joins it to the three other nodes (white circles in Fig.|4^), 
thus defining three new smaller triangles; at iteration n = 1 
one adds at the center of each of these three triangles a new 
node (squares), connected to the three vertices of the triangle, 
defining nine new triangles and then for generation n = 2 one 
node (black circles) at the center of each of these nine trian- 
gles and henceforth. The number of nodes and the number of 
connections are given respectively by iV„ = i(3"+^ + 5) 
and Ln = |(3"+^ + 1). The distribution of connections 
obeys a power-law, since the number of nodes with degree 
/c = 3, 3 • 2, 3 ■ 22, . . . , 3 ■ 2"-\ 3 • 2" and 2"+^ is equal to 
3", 3"^^, 3"~2, . . . , 3^, 3, 1 and 3, respectively. Thus one has 
P{k)(xk-^ with 7 = In 3/ In 2. 

One main difference from the BA network is that, for Apol- 
lonian networks / = 1 independently of k, due to the hier- 
archical structure shown in Fig. |4^. In Fig|4j5 one observes 
the logarithmic behavior of t similar to the BA case. In the 



FIG. 5: Propagation of information on a real friendship network of 
American students (Til averaged over 84 schools. In (a) we show 
the spreading time r as function of degree k, plotting in the inset, 
the average degree of neighbors of nodes with degree k. In (b) 
the spread factor /, both as a function of degree k, with the inset 
showing the degree distribution P{k). 



Apollonian case the logarithmic behavior can even be derived 
analytically as follows. From Fig. one sees that vertices 
belonging to the nth generation communicate with each other 
through n steps thus t (x n. Since the degree of the nth gen- 
eration is given by |[ll| fc = 3 x 2"^^, one obtains the logarith- 
mic dependence of r shown in Fig. |4j;, where the dashed line 
yields the expression in Eq. ([T]i with A = —0.28 and B = 1.1. 

Next, we show that the main results obtained for the scale- 
free networks above are also characteristic of real empirical 
social networks. For that, we study the model for informa- 
tion propagation on a real social network, namely, the one 
extracted from empirical data obtained in an extensive study 
done within the National Longitudinal Study of Adolescent 
Health (AddHealth) IH] at the Carolina Population Center 
The data comprehends a survey done between 1994 and 1995 
in 84 American schools evaluating an in-school questionnaire 
to 90118 students. The students are separated by the school 
they belong to and therefore there are 84 networks with sizes 
ranging from ~ 100 to ^ 2000 students. The aim is to al- 
low social network researchers interested in general structural 
properties of friendship networks to study the structural and 
topological properties of social networks ll24ll . In previous 
studies iflgj I25I1 . it has been shown that the main properties 
characterizing the underlying networks from these data can 
be easily reproduced with a mobile agent model. 

As shown in Fig. |5^, while for small k the spreading time 
grows linearly, for large k it follows a logarithmic law given 
by Eq. ([TJ with A = -2.84 and B = 1.98. Here, the log- 
arithmic growth of T with k follows the same dependence of 
the average degree fc„„ of the nearest neighbors ll26ll . as il- 
lustrated in the inset of Fig. [5^. Further, the non-trivial effect 
of having an optimal degree fco is also observed in Fig. |5}). 
For these schools one obtains fco 7 neighbors as an optimal 
value for which / ^ 0.42, meaning that less than half of the 



first neighbors are readied. In other words, with less friends 
[k < fco), the information is more able to reach a larger frac- 
tion of them. But, contrary to intuition, the same occurs for 
the nodes having a larger number of friends. 

Interestingly, information spreads in the same way either 
through these empirical networks as on scale-free networks, 
although the corresponding topological and statistical features 
are known to be quite distinct [ 19, 25]. For instance, as shown 
in the inset of Fig. |5}5, the degree distribution P{k) of the 
school networks is typically exponential and not power-law. 
Since the same optimal degree appears in BA networks, one 
argues that the existence of this optimal number is not nec- 
essarily related to the degree distribution of the network, but 
rather to the degree correlations. However, the relation be- 
tween degree correlations, measured by fc„„, and the loga- 
rithmic behavior of the spreading time is not straightforward. 
While in the empirical network we find the same distribution 
for both knn and t, in BA and APL networks fc„„ follows 
a power-law with k. In the case of uncorrected networks, 
two and three-point correlations reduce to simple expressions 
of the moments of the degree distribution. Therefore, / is 
independent of the degree, similarly to what is observed for 
the density of particles as derived by Catanzaro et al jZTl l in 
diffusion-annihilation processes on complex networks. 

To go further with the characterization of information 
spreading on networks, we next study the distributions, P{t) 
and P{f). In Fig. |6h we see that for the Apollonian network 
the distribution P{t) of the spreading time decays exponen- 
tially. This behavior can be understood if we consider that 
P{T)dT = P{k)dk and use Eq. ([TJ together with the degree 
distribution, P{k) oc k~'^ , to obtain 
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for large k. The slope in Fig.|6^ is precisely (1 — 7)/i3 = 
—0.17 using B = 1.1 from Fig. |4}; and 7 = 2.58 from 
Ref. H. 

For the school network P(r) follows an exponential decay 
for large t, as shown in Fig.|6}5, and has a maximum for small 
T. For comparison, we also plot in Fig. |6j5 the distribution 
P{t) for the BA network with m = 9, which has a very sim- 
ilar shape but is shifted to the right, due to the larger minimal 
number of connections. In both cases, the distribution is well 
fitted by an exponential. The reason for the similiarities be- 
tween empirical networks and BA networks at the particular 
value m = 9 may be related to the way the questionnaire was 
made at the schools: each student should name their friends 
out of a maximal number of 10 acquaintances. From the simi- 
larities we could now argue that in fact on average the students 
elected 9 acquaintances each. 

Figure|6}; shows the distribution P{f) for a scale-free BA 
network, while Fig. |6]i shows the same distribution for the 
empirical networks. Before studying such distributions the 
following remarks should be taken into account. The spread- 
ing factor depends on the number k of neighbors and con- 
sequently depends also on the network size, since the larger 
the network the larger the maximal number k^ax of neigh- 
bors a node may have. Furher, the spread factor varies al- 
ways between the minimal value and the maximal value 
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FIG. 6: Distribution P{t) of spreading times r for (a) the Apollo- 
nian network of 8 generations, and (b) the real school network (cir- 
cles) and the BA network with m = 9 and A'^ = 1000 (solid line). 
The dashed lines indicate the best fit to the data for large r values 
of Eq. ([3}, with parameters (1 — 7)/B = —0.45 and —1.26 for the 
APL in (a) and the real school network in (b), respectively. Below, 
the distributions of / are shown for (c) the BA network with the same 
parameter values (inset magnifies the range / £ [0.4, 0.6]), for (d) 
the schools and for (e) an artificial distribution of all possible frac- 
tions / among the same number of nodes and neighbors. The highly 
positive skewness in P(f) of both BA and schools networks are in 
strong deviation with the artificial distribution, indicating a structure 
among the way neighbors connect with each other (see text). 



1 and for a given node with k neighbors the possible values 
are / = 0,1/ k, 2/fc, . . . , {k — 1)/ k,l. Consequently, if for 
a specific network all the possible /-values appear with the 
same probability one should expect the distribution P{f) to 
be symmetric around / = 1/2 with discrete peaks at n/k for 
n = 0,1, ... ,k and k = 1, . . . , kmax- This artificial distribu- 
tion is shown in Fig. |6^, obtained from all possible fractions 
constructed with all integers from = 1 to 1000. 

For BA networks, there is also a symmetry in the vicinity 
of / = 1/2 (Fig. |6h). However, different from an uniform 
distribution, one finds a strong asymmetry between small and 
large values of /: the most pronounced peaks are observed 
for / < 0.1. This same behavior is observed for the em- 
pirical school networks, as shown in Fig. |6}l, which is also 
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FIG. 7: (a) Propagation of information in small- world networks: 
Spreading time r, clustering coefficient C /Co and spread factor / 
as a function of the logarithm of the rewiring probability p for the 
small-world lattice with A'^ = 10* sites. Co — 1/2 is the cluster- 
ing coefficient of a regular lattice. In all cases we average over 100 
configurations and fco = 4 (see text), (b) Dependence of the spread 
factor / on fc* = {k~ kmin) / (kmax — kmin), for the random graph 
with N = 10'^ sites and p = 0.02 (circles), 0.04 (squares) and 0.08 
(triangles). In the inset: the spreading time r of the random networks 
for the same parameter values. 



strongly asymmetric when compared with the corresponding 
uniform distribution of all possible values of / sketched in 
Fig B- The positive skewnesses indicate a higher frequency 
of low /-values than of larger ones, which indicates in fact 
that the neighbors of nodes tend to form small separated sets 
of linked neighbors. Consequently, one is able to address how 
the connections between neighbors are groupped only by mea- 
suring the spreading factor for the central node. For the dis- 
tribution P{f) of the Apollonian network one trivially finds 
P{f) — 6{1 — f) since the hierarchical structure of the net- 
work always yields / = 1, as mentioned before. 

Social networks are usually small-world ll28ll . i.e., they are 
characterized by a high clustering coefficient and a low av- 
erage shortest path length. Since we are interested in social 
systems we will next study the propagation of information on 
artificial small-world networks, constructed as follows ll28ll . 
One starts with a regular lattice where each node is attached to 
/co neighbors symmetrically displaced. Such regular network 
is characterized by a clustering coefficient Co and a shortest 
path length Lq. In this regular network, all links are short- 
range. Then, sweeping over all nodes one rewires with prob- 
ability p each link to a randomly chosen node. By doing this 
there will be on average pkoN/2 long-range links. 

For p = the network is a regular structure where no long- 
range links exist, yielding a large average path length and clus- 
tering coefficient. For p = 1 all links are long-range produc- 
ing a random graph structure where both average path length 
and clustering coefficient are small. Increasing p from to 
1, one first observes the decrease of the shortest path length 
L, when compared to Lq, and only for larger values of p the 



decrease of the clustering coefficient C, as shown in Fig.|7^. 
Therefore, in the middle range between the decrease of L and 
the decrease of C one obtains the small-world effect where 
L/Lq is small and C/Cq is large ll29tl . As shown in Fig.|7^ 
this range is approximately —2 < logp < —1. In Fig.|7h one 
also sees that both the spread factor / starts to decrease at ap- 
proximately the same value of p as the normalized clustering 
coefficient C/Cq . 

Figure |7}5 illustrates the variation of the spread factor as 
a function of the degree in the particular case of a random 
network. Instead of the above procedure with p = 1 fixed, 
random networks can also be constructed by starting with 
N nodes and introducing with probability p' one link be- 
tween each pair of nodes. Typically, in random networks 
there is a threshold pj, beyond which different structure and 
dynamical features appear This is also the case for gos- 
sip propagation. Figure [TJ? shows the behavior of / in ran- 
dom networks for three illustrative values of p' = 0.02, 0.04 
and 0.08, while the inset shows the coiTesponding spread- 
ing time. Since in random networks the average degree in- 
creases with p', we choose to compute / and t as functions of 
k* ~ (A:— fcmi„)/(fcma2;— /cmin) inordertofacilitatecompari- 
son. Forp' = 0.02 and lower values both the spread factor and 
spreading time remain approximately constant, with / ~ 1/fc 
and r ~ 1. Increasing the probability to p' = 0.04 increases 
the average degree per node and also the spread factor beyond 
its initial value f = 1/k, and consequently the corresponding 
spreading time, r > 1, increases with fc. Increasing even fur- 
ther the probability to p' = 0.08 and beyond, more and more 
connections are introduced throughout the network, in partic- 
ular among the neighbors of each node, which enables more 
nearest neighbors to know about the gossip. Consequently, 
on average one obtains /max = 1 independently of fc. This 
maximal value for such values of p' means that the spreading 
attains all the neighbors of the victim. Therefore one should 
expect that the time to reach complete spreading should de- 
crease with k*, which is what one observes in the inset of 

Fig. Eh. 

As a preliminary conclusion of this section one can state 
that, although different in their structure, empirical social net- 
works behave similarly to scale-free networks when subject 
to propagation of information over the first neighborhood of a 
particular target-node. 

III. BEYOND THE FIRST NEIGHBORS 

In this Section we will study how / and r change when 
the information is able to propagate beyond first neighbors. 
For that, we consider two different regimes of information 
spreading. In the first regime, it spreads among the first and 
second neighbors of the victim, and in the second it spreads 
throughout the entire network. For the latter, there are two 
other quantities of interest that we introduce here. One is the 
total fraction F^v of nodes who know and transmit the infor- 
mation, defined as 
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FIG. 8: Information or gossip propagation througli the first two 
neigfiborfioods in American schools: (a) Spreading time t as a func- 
tion of k and (b) the spread factor / as a function of k. As one sees 
the optimal number fco for which / attains a minimum decreases sig- 
nificantly compared with the previous situation (see text). 



where Ng is the maximal number of nodes in the entire net- 
work which already know the information and N is the total 
number of nodes. Second, the maximal spreading time Tmax 
defined as the number of time-steps necessary to attain the 
fraction F^r. 

Figure |8] shows the spreading dynamics in the American 
schools when it spreads among the two first neighborhoods 
of the victim. The behavior is significantly different from the 
one observed previously (compare with Fig. [Sj. From Fig. [8^ 
one sees that the spreading time becomes independent on k 
for large values deviating from the logarithmic dependence 
observed previously. 

As for the spread factor / shown in Fig. [Sj), one still ob- 
serves an optimal value minimizing the spreading of the gos- 
sip, but this value is now much lower than the one found for 
propagation only among common neighbors of the originator 
and the victim. Probably here, contrary to what happens in 
the previous case, the optimal value vanishes when the net- 
work size or the number of connections increase. This con- 
jecture will be reinforced next by studying artificial scale-free 
networks. 

As illustrated in Fig.|9]the same behavior observed for the 
schools is also observed for BA networks. Here, the results 
for three different BA networks are shown for m = 3 (cir- 
cles), TO = 5 (squares) and to = 7 (triangles). The spreading 
time T attains also a constant value independent on k for large 
/c-values (Fig.|9^). Obviously this plateau decreases with the 
minimal number m of connections and our simulations show 
that the dependence on m is approximately logarithmic for 
small values of k. This decrease happens because increasing 
TO increases the number of links per node, enabling a faster 
propagation. Moreover the maximal value to which t con- 
verges for large k can be explained as follows: since now the 
information spreads over first and second neighbors, if the net- 
work has poor /c-correlations, for sufficiently large k, all val- 



FIG. 9: Propagation of information among first and second neighbors 
of a BA scale-free network. Here one sees (a) the spreading time r as 
a function of the degree fc for m = 3 (circles), m = 5 (squares) and 
m — 7 (triangles), (b) Spread factor / for the same m = 3 (circles), 
m = 5 (squares) and m — 7 (triangles). Here = 10*, averages 
over 100 realizations were considered and logarithmic binning in k 
was used. 



ues of k start to be present within the two first neighborhoods 
yielding an independence of r on A;. The distribution of the 
spreading time presents also an approximatelly exponential 
tail with a slope that increases with to. 

As for the spread factor /, the optimal value ko is observed 
only for small to (to = 3) and rapidly vanishes when to is 
increased. In fact, for large values of to one finds large values 
of / decreasing with fc as / cx 1/fc. This occurs independently 
of TO. Due to the large values of /, the distribution P{f) has 
again a very pronounced peak at / = 1. 

While for these BA networks the results are quite differ- 
ent when the two first neighbors are considered instead of 
only nearest neighbors, the Apollonian network displays an 
almost invariant behavior for an Apollonian network almost 
the same behavior remains. The lack of sensibility to the in- 
crease of the neighborhood in Apollonian networks is a conse- 
quence of its hierarchical structure. Also for small-world and 
random networks similar results are obtained. So, as prelimi- 
nary conclusions one sees that in hierarchical networks and in 
networks with small-world property it does not matter if the 
information can be transmitted beyond the victim's acquain- 
tances or not: in one way or another everyone rapidly knows 
our secrets ! 

After seeing what happens in small neighborhoods, the next 
question refers to the opposite limit, i.e., when all nodes are 
able to get the information from the originator Of course 
in this case the fraction / almost always achieves eventually 
its maximal value / = 1, since the information eventually 
reaches everybody. This is a similar situation of what happens 
with the spread of rumours or epidemics. Though, there is 
still the case when some neighbor of the victim has no other 
friends and therefore the information cannot spread from or to 
it. The main question now is not only to know the minimal 
time T needed for the information to reach the maximal num- 
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FIG. 10: The spread of information tlirough tfie entire school net- 
works, (a) Spreading time r and maximal spreading time Tmax as 
function of degree k. (b) Spread factor / and total affected fraction 
Fjv as a function of k. 



ber of nearest neighbors of the victim, but also to compare 
it with the maximal time Tmax needed for the information to 
achieve the maximal fraction Fm (see Eq. (|4|i) of nodes which 
are reached. 
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starting with a larger number of friends. 

As explained above the spread factor is approximatelly one 
independently of n, yielding a delta distribution P{f) ^ — 
/), while the maximal fraction Fpf increases fast for small k 
and rapidly attains a more or less constant value around Fjv ~ 
0.6. Therefore, no optimal number of friends is observed. 

Figure [TTIshows what happens in the BA case. As one sees 
from Fig. [TTk . both t and Tmax decrease with m. Further, 
for both quantities, r (black symbols) and Tmax (white sym- 
bols), a fast convergence to a logarithmic dependence on k is 
observed when k increases. Interestingly, while the slope as a 
function of log k differs between r and Tmax, in each case it 
is approximately independent of to, being apparently a feature 
of the scale-free topology. 

In this situation one has always f ~ 1. As for F/v, very 
large values are now observed (F/v > 0.7) independently of 
k and F/v increases very fast attaining Fjv ~ 1 for fc > 10 
neighbors (see Fig.fTTb). In other words, on BA networks, in 
order that all neighbors of a certain victim get the information, 
it must spread throughout the entire network. 
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FIG. 12: Propagation of information on an Apollonian network with 
71 = 8 generations: (a) Minimal time r and maximal time Tmax and 
(b) the fraction Fjv between the total number of nodes which are 
reached by the information and the total number A'^ of nodes, both as 
functions of k. Here, P{t) cx P{Tmax) oc P{k) oc k ' (see text). 



FIG. 1 1 : The propagation of information throughout an entire B A 
network, (a) the spreading time r and maximal spreading time Tmax 
as a function of the degree k for m = 3 (circles), m = 5 (squares) 
and m — 9 (triangles). The total fraction Fm of nodes that get the 
information is plotted in (b). In all cases, / = 1 always (see text). 
Here A'^ = 10^, averages over 100 realizations were considered, and 
logarithmic binning in k was used. 

For the school networks, the behavior is illustrated in 
Fig. [To] From Fig. [TOb one sees that the behavior of t is al- 
most the same as in Fig.fS^. The maximal time decreases with 
k before attaining an approximatelly constant value. The large 
fluctuation for fc > 25 is due to poor statistics. The decrease 
of T,nax for Small k occurs, since for victims with less friends 
the successive neighborhoods through which the information 
spreads comprehend a smaller amount of neighbors than when 



Figure [12] illustrates the case of the Apollonian network. 
The value of Tmax > T increases more slowly with fc, being 
both quantities equal for very large fc values. This similarity 
between both spreading times r ^ T,nax is in fact another ev- 
idence for the fact that in order to enable the information to 
reach all neighbors it must spread throughout the entire net- 
work. In fact, from Fig. [T2b one also sees that in the range 
where Tmax > T, Fjy < 1, being equal to one only in the 
range r = Tmax- 

Finally, we examine the case of small-world networks il- 
lustrated in Fig. [13] From Fig. [T3b one sees that the spread- 
ing time T increases almost linearly with the rewiring prob- 
ability p except at the end for large values of p (random 
network). The maximal spreading time Tmax is very large 
for low rewiring probabilities, due to a large average path 
length, and decreases one order of magnitude in the range 
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FIG. 13: Information propagation in small- world networks when it 
spreads over the entire network, (a) Spreading time r and maximal 
time Tmax and (b) total fraction Fm as a function of the rewiring 
probability p. Here A'^ = 10'^ nodes and averages over 100 realiza- 
tions were considered. 



—2 < logp < —1 corresponding to small-world networks. In 
fact, Tmax follows the dependence of the average path length 
on p. 

As for the total fraction i^jv illustrated in Fig.fTSb one finds 
the opposite dependence on p than the one found for Tmax- 
for low (large) values of p one finds low (large) values of F/v, 
and a pronounced increase is observed throughout the entire 
small-world regime. To explain this behavior one must use 
both the average path length and the clustering coefficient, 
L/Lq and C/Cq shown in Fig. |7^. For random networks 
(p = 1) the total fraction attains = 1 very fast due to 
the very short average path length. For small values of p, al- 
though regular networks have an average path length that is 
larger than in random networks, the spreading time needed to 
attain F^ = 1 is now proportional to i. In the small- world 
regime however, the average path length is small but the way 
the neighbors are connected isolates in some few cases nodes 
from the information spreading process. So, although small- 
world networks have large cluster coefficients as in regular 
networks, the long-range connections change significantly the 
local topology of a given node-neighborhood. 



IV. INTRODUCING A TRANSMISSION PROBABILITY 

In all the previous results each friend will surely spread the 
gossip further Fortunately people are on average not as nasty 
as that. One should expect that only a certain fraction q < I 
of our friends are not worth to be trusted. In this Section we 
address this more realistic situation. 

Since we do not have any sociological information about 
the topological features of the 'good' friends we introduce q 
as a probability that a node has to spread the gossip. For the 
particular case q = I one reduces to the situations studied 
previously. 

Two possible ways of propagation may then occur One 



FIG. 14: Information or gossip propagation among tirst neighbors 
with probability g on a real friendship network of American stu- 
dents lUsIl averaged over 84 schools. In (a) we show the spreading 
time T and in (b) the spread factor /, both as function of degree k. 
The insets show the same data for the BA network with m — 9 and 
A'^ — 1000. At each time-step each node which knows the gossip 
tries to spread it. In all plots one has q = (o), q = 0.2 (•), q — 0.4 
(□), q = 0.6 (■), q = 0.8 (A) and g = 1 (A). 



concerns a scenario where friendships connections are related 
to contacts between the nodes at a given instant. In this situ- 
ation a certain individual tries only once, with probability q, 
to spread the information to its friends. Therefore, if the gos- 
sip is not 'accepted' once it will never be. Another scenario 
is of course when the spread is tried repeatedly at each time- 
step. We will start with this latter scenario and end with the 
more pleasant one where gossip is only able to spread from 
the nodes which heard it most recently. 

Introducing the new parameter q in the model we go back 
to the first information spreading model studied in Section Hll 
where the gossip only spreads to friends of the victim. At each 
time-step the neighbors which already know the gossip repeat- 
edly try to spread it to other friends of the victim. Therefore, 
one expects to attain the same value of / that one measured 
for (7 = 1, but this time only after a larger spreading time, 
namely t' = r/g. Figure [T4l shows the result of such infor- 
mation propagation regime for the school networks, and for 
several values of q. The corresponding curves of / are plotted 
inFig.[T4b. 

Of course for g = the spreading time is always r = 
and the spread factor equals / = 1/fc since only the node 
starting the gossip will know it. As expected, for all other 
values the spread factor coincides with the one for q = 1, 
while the spreading time preserves its logarithmic dependence 
on k for large degrees, and the exponent increases with 1 / g,as 
explained below. 

In the insets of both plots in Fig. [14] we show for compari- 
son the spreading time t and spread factor / for a BA network 
with N = 1000 and m = 9. A strong deviation from the log- 
arithmic dependence of the spreading time is observed, due to 
the high number of initial outgoing connections (m — 9). 
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FIG. 15: Gossip propagation in an Apollonian network with n = 8 
generations, for 10 values of probability g = 0, 0.1, 0.2, . . . , 0.9 
and 1. The slope B of the dashed lines which fit the data de- 
creases with q as shown in the inset where the line yields B — 
exp (0.23 - 0.88 log g) ~ 1/g (see text). 



closely a hyperbolic behavior, B 
written more generally as 



1 /q. Thus, Eq. ([T]l can be 



(5) 



Finally, we can also assume that the person to which a gos- 
sip did not spread at the first attempt, will never get it. In 
this way, the gossip is a quantity which percolates through the 
system. 

In Fig.[T6]we see the behavior of r and / for different values 
of q for the school networks and in the inset for the BA net- 
work. When the spreading probability q decreases, the min- 
imum in / first shifts to larger k and finally disappears. The 
asymptotic logarithmic law of r for large k remains for all 
probabilities q. As in previous cases, the BA network has a 
similar behavior as the school friendships. The Apollonian 
network, however, behaves quite differently: r first increases 
with q and then eventually falls off to zero so that there exists 
a special value qmax ~ 0.75 for which the spreading time r is 
maximized. 



V. DISCUSSION AND CONCLUSIONS 
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FIG. 16: Propagation of gossip among first neighbors on a real 
friendship network of American students ITsIl , when persons to 
which a gossip did not spread at the first attempt will never get it. 
In (a) we show the spreading time r and in (b) the spread factor /, 
both as a function of degree k. The insets show the same data for the 
BA network with m = 9 and A'^ — 1000. After knowing the gossip 
each node tries to spread it only once (see text). In all plots one has 
g = (o), g = 0.25 (•), g = 0.5 (□), g = 0.75 (■) and g = 1 (A). 



The logarithmic dependence of the spreading time can be 
more easily seen when studying the Apollonian network as 
shown in Fig. [15] Here we plot the spreading time for 10 
different values of q and fit all of them with a logarithmic 
function as the one in Eq. ([T]i. The corresponding slope B as 
a function of q is plotted in the inset of Fig. [15] and follows 



In this paper, we studied a general model of information 
spreading suited for different kinds of social information. In 
the usual case of rumour or opinion propagation the informa- 
tion spreads throughout the network, and all nodes are equally 
capable of transmiting the information to their neighbors. Two 
measures were proposed to characterize the spreading of such 
model, namely, the spreading factor measuring the accessi- 
ble neighborhood around each node which can be reached by 
the information spreading, and the spreading time which com- 
putes the minimum time to reach such neighborhood. 

Further, we have shown that by computing these quantities 
for each node the resulting distributions give additional insight 
to the underlying network structure on which the spreading 
takes place. More precisely, the magnitude of the skewness 
of the distribution of the spreading factor gives a measure of 
how difficult it is to access one neighbor, starting from another 
one. For positive values of the skewness, most of the pairs of 
neighbors are connected by some path of connections, while 
for negative values of the skewness, neighbors are more likely 
groupped in separated connected pairs. 

In the particular case that the information is about a certain 
target-node and thus is of interest to a restricted neighbor- 
hood around it, one yields a minimal model to study gossip 
spreading. Applying such a scheme to artifical and empirical 
networks, we found that, although different in their statistical 
properties, information on empirical social networks seems to 
spread similarly to what is observed in scale-free networks. 
In both cases, the spreading time shows a logarithmic depen- 
dence on the degree, indicating small-world effect within the 
nearest neighborhood of the nodes. Further, from the compu- 
tation of the spreading factor we observed that there is a non- 
trivial optimal number of friends which minimizes the danger 
of being gossipped that depends on the size of the network 
and on total number of acquaintances in it. We also showed 
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that this optimal value is characteristic of either scale-free net- 
works or real social networks, but is not observed in small- 
world networks, rising the question of what network proper- 
ties may give rise to the emergence of such an optimal value. 

However, when the information spreads beyond the near- 
est neighbors, in a similar way as for propagation of rumours 
and epidemics, this optimal value disappears with the spread- 
ing factor rapidly converging to / = 1. Also the logarithmic 
dependence of the spreading time no longer holds in this case. 

Since one person does not in general spread information to 
all its neighbors, neither at the same time nor with complete 
certainty, we also studied regimes of information propagation 
where the spreading from one node to another occurs with 
some probability q. 

Due to their particular features and assumptions, our con- 
cepts and measures to address the propagation of information 
in networks could be suited to other situations. For instance. 



in the case of the Internet, some trojan horses need to connect 
to a specific host to download some data in order to become 
effective. For them the spread factor should be a good mea- 
sure to assess the vulnerability to the spreading of this virus 
attack. In this situation probably an experimental test of the 
emergence of the optimal degree found in the cases stated here 
could be easier to be implemented. 
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